A polynomial algorithm for balanced clustering via graph partitioning

نویسندگان

چکیده

Abstract The objective of clustering is to discover natural groups in datasets and identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics data. problem can be seen as detecting inherent separations between a given point set metric space governed by similarity function. pairwise similarities all data objects form weighted graph whose adjacency matrix contains necessary information for process. Consequently, task formulated partitioning problem. In this context, we propose new cluster quality measure uses ratio intra- inter-cluster variance allows us compute optimal under min-max principle polynomial time. Our algorithm applied both partitional hierarchical clustering.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Polynomial Algorithm for Balanced Clustering via Graph Partitioning

The objective of clustering is to discover natural groups in datasets and to identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics of the data. The problem can be seen as detecting the inherent separations between groups of a given point set in a metric space governed by a similarity function. The pairwise similarities between all ...

متن کامل

Doubly Balanced Connected Graph Partitioning

We introduce and study the Doubly Balanced Connected graph Partitioning (DBCP) problem: Let G=(V,E) be a connected graph with a weight (supply/demand) function p:V→ {−1,+1} satisfying p(V )= ∑ j∈V p(j)=0. The objective is to partition G into (V1, V2) such that G[V1] and G[V2] are connected, |p(V1)|, |p(V2)|≤cp, and max{ |V1| |V2| , |V2| |V1|}≤cs, for some constants cp and cs. When G is 2-connec...

متن کامل

Streaming Balanced Graph Partitioning for Random Graphs

There has been a recent explosion in the size of stored data, partially due to advances in storage technology, and partially due to the growing popularity of cloud-computing and the vast quantities of data generated. This motivates the need for streaming algorithms that can compute approximate solutions without full random access to all of the data. We model the problem of loading a graph onto ...

متن کامل

Bootstrap clustering for graph partitioning

Given a simple undirected weighted or unweighted graph, we try to cluster the vertex set into communities and also to quantify the robustness of these clusters. For that task, we propose a new method, called bootstrap clustering which consists in (i) defining a new clustering algorithm for graphs, (ii) building a set of graphs similar to the initial one, (iii) applying the clustering method to ...

متن کامل

A graph-based clustering method for a large set of sequences using a graph partitioning algorithm.

A graph-based clustering method is proposed to cluster protein sequences into families, which automatically improves clusters of the conventional single linkage clustering method. Our approach formulates sequence clustering problem as a kind of graph partitioning problem in a weighted linkage graph, which vertices correspond to sequences, edges correspond to higher similarities than given thres...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: European Journal of Operational Research

سال: 2021

ISSN: ['1872-6860', '0377-2217']

DOI: https://doi.org/10.1016/j.ejor.2020.07.031